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Introduction 


The  objectives  of  year  2  of  award  number  DAMD17-98-1-8143  were  to  express  the 
extracellular  metalloproteases  meprin  a  and  p  in  a  cultured  human  breast  cancer  cell  line.  A 
suggested  role  for  meprins  in  cancer  cells  is  based  on  observations  of  meprin  expression  in 
several  types  of  cancer  cells,  especially  colon  cancer,  and  on  the  in  vitro  proteolytic  activity 
of  these  enzymes.  We  have  shown  that  meprin  p  mRNA  is  present  in  many  types  of 
cultured  human  cancer  cell  lines,  including  breast,  colon  and  pancreas  (Matters  and  Bond, 
1999).  Meprin  a  protein  is  secreted  from  a  metastatic  colon  cancer  cell  line  (SW620)  but 
not  from  a  nonmetastatic  colon  cancer  cell  line  (SW480)  (Matters  and  Bond,  unpublished 
data).  Others  have  reported  that  human  meprin  a  is  secreted  from  Caco-2  colon  cancer 
cells  and  from  colorectal  tumors  (Lottaz  et  al.,  1999).  In  vitro,  meprin  a  and  p  can  degrade 
ECM  components  as  well  as  bioactive  peptides,  but  each  enzyme  has  unique  substrate 
specificities  (Bond  and  Beynon,  1995,  Bertenshaw  et  al.,  unpublished  data).  Therefore, 
our  hypothesis  is  that  the  proteolytic  activities  of  meprins  may  contribute  to  tumor  growth 
and/or  to  the  metastatic  potential  of  breast  cancer  cells. 

To  address  this  question,  constitutively  expressed  meprin  cDNA  clones  have  been 
stably  transfected  into  a  moderately  metastatic  breast  cancer  cell  line,  MDA-MB-231.  Once 
these  stable  transfectants  are  characterized,  the  human  breast  cancer  cells  overexpressing 
the  meprin  subunit  proteins,  and  vector  transfected  controls,  will  be  tested  in  vitro  for  their 
growth  properties  and  invasiveness,  and  also  injected  in  nude  mice  to  determine  their 
tumorigenicity  and  metastatic  behavior  (Objectives  for  years  3  and  4).  This  will  assess  the 
effect  of  meprin  overexpression  on  the  metastatic  potential  of  breast  cancer  cells  in  in  vitro 
and  m  vivo  systems.  These  experiments  will  address  whether  overexpression  of  meprin 
protein  affects  tumor  growth,  or  may  convert  a  moderately  metastatic  breast  cancer  cell  to  a 
highly  metastatic  cell.  Thus,  the  objectives  of  year  2  were  to  create  and  characterize  stably 
transfected  breast  cancer  cell  lines  which  overexpress  the  meprin  a  and  p  proteins. 
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Body  of  the  Report 


At  the  end  of  year  1  of  this  award,  full-length  cDNA  clones  encoding  the  meprin  a 
and  |3  subunits  in  the  constitutive  mammalian  expression  vector  pcDNA  3.1(+)(Invitrogen) 
had  been  made.  The  human  meprin  a  cDNA  was  stably  transfected  into  HEK  293  cells,  a 
human  kidney  cell  line  routinely  used  for  expressing  meprin.  Several  clones  expressing 
high  levels  of  the  protein  were  selected.  The  human  meprin  a  cDNA  was  also  transfected 
in  the  nonmetastatic  human  breast  cancer  cell  line  MCF7.  In  contrast,  when  the  human 
meprin  p  cDNA  was  transfected  into  HEK  293  cells,  no  stable  transfectants  or  transient 
transfectants  expressing  the  human  meprin  p  protein  could  be  detected.  This  result  is 
similar  to  the  problems  experienced  when  expressing  the  mouse  meprin  p  cDNA  in  HEK 
293  cells,  where  no  expression  of  the  mouse  enzyme  can  be  detected  either.  However, 
stable  HEK  293  clones  expressing  the  rat  form  of  the  meprin  p  protein  were  obtained  and 
characterized.  Thus,  all  further  work  on  meprin  p  expression  will  be  done  using  the  rat 
meprin  p  cDNA.  For  consistency,  a  rat  a  meprin  cDNA  clone  in  the  mammalian 
expression  vector  pcDNA  3.1(+)  was  also  constructed.  The  rat  meprin  a  cDNA  produced 
many  clones  with  high  levels  of  meprin  a  protein  expression  in  HEK  293  cells.  Therefore, 
both  meprin  a  and  p  cDNA  clones  that  reliably  produced  recombinant  protein  in  cultured 
cell  lines  were  available. 

At  the  beginning  of  year  2,  the  experimental  focus  was  on  characterizing  the  MCF7 
breast  cancer  cell  lines  which  had  been  transfected  with  a  human  meprin  a  cDNA.  Initial 
screening  of  meprin  a  transfected  MCF7  clones  by  RT-PCR  revealed  several  clones  which 
expressed  low  levels  of  the  human  meprin  a  mRNA.  These  clones  were  then  examined  for 
the  expression  of  meprin  a  protein  by  Western  blots.  No  meprin  a  protein  expression  was 
detected  in  any  of  the  MCF7  clones  which  had  showed  meprin  a  mRNA  expression.  In 
comparison  to  transfected  HEK  293  cells,  the  level  of  meprin  a  mRNA  expression  in  the 
MCF7  transfectants  was  considerably  lower,  thus  it  appears  that  the  level  of  meprin  protein 
in  the  MCF7  clones  may  have  been  below  the  level  of  detection  by  Western  blot. 
Alternatively,  the  meprin  a  protein  may  have  been  rapidly  turned  over  by  the  MCF7  cells. 
Rather  than  repeat  the  unsuccessful  MCF7  transfection,  another  human  breast  cancer  cell 
line,  MDA-MB-231,  was  chosen  to  continue  this  study. 

The  MDA-MB-231  breast  cancer  cell  line,  obtained  from  Dr.  Dan  Welch,  Jake 
Gittlen  Cancer  Center,  Penn  State  College  of  Medicine,  is  moderately  metastatic  in  nude 
mice  models  and  can  be  easily  transfected  using  lipid-based  transfection  reagents. 
Therefore,  this  cell  line  is  better  for  determining  whether  meprin  can  increase  invasive  or 
metastatic  characteristics  of  breast  cancer  cells  than  the  nonmetastatic  MCF7  cell  line 


previously  used.  In  addition,  a  more  highly  metastatic  breast  cancer  cell  line,  MDA-MB- 
435,  was  also  obtained  from  Dr.  Welch.  Both  these  breast  cancer  cell  lines  were  screened 
for  endogenous  expression  of  the  meprin  a  and  p  mRNAs  by  RT-PCR  and  for  meprin 
protein  by  Western  blot.  Western  blots  of  media  and  cell  membrane  fractions  were  probed 
with  anti-meprin  a  and  meprin  p  antibodies,  respectively.  No  detectable  meprin  protein 
was  found  in  either  the  MDA-MB-231  or  MDA-MB-435  cell  or  media  fractions.  However, 
RT-PCR  done  with  meprin  p  primers  did  detect  a  low  level  of  transcript  in  both  breast 
cancer  cell  lines.  As  with  the  MCF7  cells,  the  meprin  protein  may  be  at  a  level  too  low  to 
detect  or  rapidly  turned  over.  Another  possible  reason  for  the  lack  of  detection  of  the 
endogenous  human  meprin  protein  may  be  the  quality  of  the  meprin  antibodies  currently 
available,  none  of  which  were  generated  against  human  proteins.  Recombinant  human 
meprin  a  protein  has  recently  been  purified  (Han  and  Bond,  unpublished),  and  we  are 
using  this  protein  to  make  a  human  meprin  a  polyclonal  antibody.  A  polyclonal  rat  meprin 
P  antibody  is  also  being  produced  and  should  be  available  shortly. 

Because  the  rat  meprin  a  and  p  cDNAs  gave  consistent  expression  of  meprin 
protein  in  HEK  293  cells  (Year  1  of  this  award),  these  cDNAs  were  used  to  transfect 
MDA-MB-231  cells.  Using  Lipofectamine  2000  (Life  Technologies),  MDA-MB-231  cells 
were  transfected  with  the  rat  meprin  a  and  p  cDNAs  as  well  as  with  the  vector  plasmid  only 
(pcDNA  3.1+)  as  a  negative  control.  MDA-MB-231  clones  expressing  the  meprin  a  or 
meprin  p  cDNAs  were  obtained.  Clones  expressing  both  high  and  low  levels  of  meprin 
protein,  based  on  Western  blots  (Fig.  1),  were  selected  for  further  study.  The  availability 
of  MDA-MB-23 1  cells  expressing  a  range  of  meprin  protein,  from  high  levels  to  barely 
detectable  on  Western  blots,  will  prove  useful  in  m  vivo  studies.  We  will  be  able  to 
compare  MDA-MB-231  cells  expressing  different  amounts  of  meprin  for  their 
tumorigenicity  and  metastatic  potential  in  mice  models. 

MDA-MB-231  clones  expressing  high  levels  of  meprin  a  or  meprin  p  were 
fractionated  into  soluble  and  membrane  bound-proteins,  and  the  media  containing  secreted 
protein  was  also  collected  and  concentrated.  Membrane-bound  protein  was  released  by 
treatment  with  1%  octylglucoside,  and  the  presence  of  meprin  in  the  soluble,  membrane, 
and  media  protein  fractions  were  detected  by  Western  blots  (Objective  2,  Task  1).  In  the 
mouse  kidney  and  intestine,  as  in  HEK  293  cells,  the  meprin  a  subunit  protein  is  secreted 
from  the  cell  if  meprin  p  is  not  present,  while  the  meprin  p  subunit  protein  stays  anchored  at 
the  cell  surface  through  a  short  transmembrane  domain.  Most  of  the  meprin  p  subunit 
protein  is  extracellular,  and  through  covalent  and  noncovalent  ot/p  interactions,  the  meprin  a 
protein  can  associate  with  meprin  p  subunit  protein  and  be  maintained  at  the  cell  membrane. 
Fractionation  experiments  showed  that  the  transfected  MDA-MB-231  cells  also  secrete  the 
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meprin  a  subunit  protein  into  the  media  and  retain  the  meprin  p  subunit  protein  at  the  cell 
membrane.  Because  no  meprin  p  protein  is  present  on  the  meprin  a  transfected  MDA-MB- 
231  cells,  all  the  meprin  a  protein  was  secreted  instead  of  being  associated  with  the  cell 
membrane.  Vector  transfected  control  cells  showed  no  evidence  of  meprin  protein  on  the 
cell  membranes  or  in  the  media.  Experiments  are  currently  underway  to  make  a  double 
transfectant  of  MDA-MB-231  cells  with  both  the  meprin  a  and  p.  Expressing  both  meprin 
a  and  meprin  p  protein  will  test  whether  a/p  cell  surface  oligomers  will  be  formed  in  MDA- 
MB-231  cells  as  they  are  in  other  cultured  cell  systems. 

Most  extracellular  proteolytic  enzymes  are  regulated  in  part  by  their  secretion  in  a 
latent  or  proenzyme  form.  Activation  of  proteases  often  involves  the  removal  of  a 
propeptide  region  by  another  type  of  proteolytic  enzyme  at  or  close  to  the  cell  surface. 

Both  meprin  proteins  are  secreted  as  proenzymes  in  recombinant  expression  systems,  such 
as  HEK  293  cells.  However  there  is  preliminary  evidence  that  cancer  cells,  such  as  the 
SW620  colon  cancer  cell  line,  can  activate  meprin  a.  Therefore,  it  was  important  to 
determine  if  MDA-MB-231  cells  could  activate  the  meprin  a  or  p  proteins.  Meprin  proteins 
expressed  in  MDA-MB-231  cells  were  treated  with  a  mild  trypsin  solution  (25  ng/pl)  for  30 
minutes  at  room  temperature.  A  shift  in  the  size  of  the  meprin  protein,  as  detected  by 
Western  blots,  indicates  removal  of  the  propeptide  from  the  N-terminus.  Both  the  meprin 
a  and  p  proteins  showed  a  decrease  in  size  with  trypsin  treatment,  indicative  of  proenzyme 
activation  (Fig.  2).  Thus,  in  MDA-MB-231  cells  in  culture,  meprin  a  and  p  proteins  are 
secreted  as  inactive  forms.  In  addition,  this  also  implies  that  the  meprin  proteins  are  folded 
properly  and  are  stable.  Previous  work  in  our  lab  has  demonstrated  that  misfolded  meprin 
proteins,  such  as  mutant  proteins  with  truncations  or  deleted  domains,  are  susceptible  to 
complete  degradation  by  trypsin  (Tsukuba  and  Bond,  1998). 

Another  characteristic  of  meprins,  as  with  most  extracellular  proteases,  is  a  high 
degree  of  protein  glycosylation.  However,  cancer  cells  can  alter  the  glycosylation  patterns 
of  proteins.  The  deglycosylating  enzymes  EndoH,  which  removes  high  mannose  type 
glycosylation,  and  EndoF,  which  removes  all  N-linked  sugars,  were  used  to  analyze  the 
type  and  degree  of  glycosylation  on  the  meprin  proteins.  After  overnight  treatment  with  the 
deglycosylating  enzymes,  meprins  were  subjected  to  Western  blotting.  The  meprin  p 
protein  expressed  in  MDA-MB-231  cells  showed  a  pattern  of  deglycosylation  identical  to 
that  of  meprin  p  protein  expressed  in  HEK  293  cells.  No  high  mannose-type  sugars  were 
present  on  the  protein,  indicating  that  the  meprin  protein  was  complex  glycosylated,  and  the 
size  of  the  untreated  and  deglycosylated  proteins  was  identical,  indicating  that  the  degree  of 
meprin  glycosylation  in  the  different  cells  was  similar. 
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Key  Research  Accomplishments 

•  Screening  of  MDA-MB-23 1  and  MDA-MB-435  human  breast  cancer  cell  lines  for 
endogenous  expression  of  meprins. 

•  Production  of  stably  transfected  lines  of  MDA-MB-23 1  human  breast  cancer  cells 
expressing  either  the  meprin  a  or  p  protein. 

•  Characterization  of  the  recombinant  meprin  proteins  produced  by  MDA-MB-23 1  breast 
cancer  cells. 

Reportable  Outcomes 

1. )  Manuscripts  and  Abstracts: 

During  year  2, 1  was  a  co-author  on  a  paper  entitled  “Structure  of  the  mouse 
metalloprotease  meprin  p  gene  ( Meplb):  Alternative  splicing  in  cancer  cells”  by  W.  Jiang, 

J.  Kumar,  G.  Matters,  and  J.  Bond.  This  manuscript,  which  characterizes  the  mouse 
meprin  p  gene  ( Meplb )  and  describes  how  alternatively  spliced  exons  used  only  in  mouse 
cancer  cells  are  arranged,  is  in  press  in  the  journal  Gene.  In  April  2000, 1  was  invited  to 
attend  the  Gordon  Research  Conference  entitled  “Proteolytic  Enzymes  and  their  Inhibitors”, 
to  be  held  on  July  9- 1 3, 2000.  I  will  be  giving  a  poster  presentation  at  that  meeting. 

I  was  a  co-author  on  abstracts  for  posters  presented  at  the  two  conferences  by  a 
student  in  the  lab,  Greg  Bertenshaw.  The  first  was  at  the  conference  entitled  “Towards  an 
Understanding  of  Tolloid  Proteinases”  at  the  University  of  Manchester,  Manchester, 
England  in  May  1999,  and  the  second  was  at  the  IPS  (International  Proteolysis  Society) 
meeting  on  Mackinac  Island,  Michigan  in  September  1999.  Copies  of  the  abstracts  for 
these  posters  is  enclosed.  I  am  also  a  co-author  on  the  abstract  for  a  talk  entitled  “Meprins- 
Metzincins  with  unique  properties  and  expression  patterns”.  This  presentation  will  be 
made  by  my  mentor,  Dr.  Judith  Bond,  at  a  conference  entitled  “International  Symposium 
on  Proteases:  Basic  Aspects  and  Clinical  Relevance”  in  Montebello,  Quebec,  Canada  in 
June  2000. 

A  portion  of  my  work  was  described  at  research  seminars  given  by  Dr.  Bond  at  the 
Roswell  Park  Cancer  Institute  in  Buffalo,  N.Y.  (October,  1999)  and  at  the  Emory 
University  School  of  Medicine,  Atlanta,  GA  (April,  2000). 

2. )  Development  of  Cell  Lines: 

Two  stably  transfected  clones  of  the  human  breast  cancer  cell  line  MDA-MB-23 1, 
one  expressing  the  meprin  a  subunit  protein  and  one  expressing  the  meprin  p  subunit 
protein,  were  created.  These  cell  lines  will  be  crucial  to  upcoming  m  vivo  studies,  where 
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the  effects  of  meprin  overexpression  on  the  tumorigenicity  and  metastatic  behavior  of  breast 
cancer  cells  in  a  mouse  model  system  will  be  tested. 

3.)  Opportunities: 

Because  of  funding  of  award  number  DAMD17-98-1-8143, 1  applied  for  and,  in 
April  2000, 1  was  accepted  as  a  member  of  the  Penn  State  College  of  Medicine  Cancer 
Center  (see  attached  letter).  The  Penn  State  Cancer  Center  is  in  the  process  of  applying  for 
status  as  an  accredited  Cancer  Center  through  the  NCI.  This  procedure  is  in  the  planning 
stages,  and  I  will  be  participating  in  activities  related  to  the  Penn  State  Cancer  Center 
accreditation  process.  As  a  member  of  the  Penn  State  Cancer  Center,  I  will  have 
opportunities  to  establish  new  collaborations  with  clinical  and  basic  cancer  researchers  at 
this  institution  (The  Hershey  Medical  Center)  as  well  as  at  the  main  campus  of  Penn  State 
University  (University  Park). 

Conclusions 

Based  on  the  experiments  done  in  year  2  of  this  award,  the  expression  of 
recombinant  meprin  a  and  p  proteins  in  the  human  breast  cancer  cell  line  MDA-MB-231 
appears  to  be  very  similar  to  their  expression  in  HEK  293  cells.  The  meprin  proteins  are 
expressed  in  the  correct  cellular  compartment:  on  the  cell  membrane  for  meprin  p  and 
secreted  into  the  media  for  meprin  a.  Both  proteins  are  produced  as  trypsin-activable 
precursors  and  are  glycosylated  normally.  This  implies  that  these  proteins  are  synthesized, 
folded  and  processed  correctly  in  breast  cancer  cells.  It  is  likely,  therefore,  that  the  meprin 
protein  expressed  in  the  breast  cancer  cells  will  function  as  wild-type  meprin  protein  does. 
This  will  permit  the  remaining  portions  of  the  Objectives  of  this  award  to  proceed  as 
planned. 
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Appendices 

Figure  Legends: 

FIGURE  l .  Western  blot  screen  of  MDA-MB-231  clones  expressing  the  meprin  p  protein. 
Potential  meprin  p  expressing  cells  were  extracted  by  sonication,  the  membranes  were 
fractionated  by  ultracentrifugation  and  resuspended  in  1%  octylglucoside  in  20  raM  Tris, 
pH  7.5.  Membrane  proteins  were  separated  on  an  8%  acrylamide  gel,  blotted  to 
nitrocellulose,  and  probed  with  a  rat  meprin  p  antibody.  After  conjugation  to  an  HRP- 
linked  secondary  antibody,  meprin  bands  were  visualized  with  a  chemiluminescent  HRP 
substrate.  Out  of  the  8  potential  meprin  p  clones  (lanes  1-8),  6  clones  expressed  the  meprin 
protein  to  varying  degrees.  The  standard  (std)  is  a  membrane  protein  fraction  from  rat 
meprin  p  expressing  293  cells.  Protein  molecular  weight  markers  are  shown  at  the  left. 

FIGURE  2 .  Western  blot  analysis  of  the  trypsin  activation  of  the  meprin  a  and  p  proteins 
expressed  in  MDA-MB-231  (lanes  labelled  231)  and  HEK  293  (lanes  labelled  293)  cell 
lines.  Media  concentrates  (for  meprin  a)  and  total  membrane  proteins  (for  Meprin  p)  were 
incubated  with  trypsin  (25  ng/gl)  for  30  minutes  at  room  temperature,  which  will  remove 
the  prodomain  if  it  is  present.  Trypsin  treated  proteins  (+  lanes)  show  a  shift  in  molecular 
weight  compared  to  non-trypsin  treated  controls  (-  lanes).  The  change  in  the  size  of  the 
meprin  proteins  after  trypsin  treatment  is  consistent  with  removal  of  the  prodomain.  This 
indicates  that  the  meprin  proteins  are  not  activated  by  the  MDA-MB-23 1  cells  themselves. 
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39  Abstract 


40  The  mouse  meprin  p  gene  encodes  an  integral  membrane  protease  that  is  expressed  in  a  tissue-specific  manner  in  embryonic 

41  and  adult  epithelial  cells,  and  in  carcinoma  cells.  The  meprin  p  mRNA  in  the  embryo,  kidney  and  intestinal  cells  is  2  5  kb  whereas 

42  the  isoform  in  carcinoma  cells  (P'  mRNA)  is  2.7  kb.  The  work  herein  was  initiated  to  explore  the  molecular  mechanism  responsible 

43  for  the  different  isoforms.  Overlapping  fragments  containing  the  Meplb  gene  were  obtained  from  a  yeast  artificial  chromosome 

44  clone  using  polymerase  chain  reactions.  The  gene  spans  approximately  40  kb  and  consists  of  18  exons  and  17  introns.  The  first 

45  three  exons  are  unique  to  the  5'  end  of  P'  mRNA;  the  next  two  exons  correspond  to  the  5'  end  of  P  mRNA.  The  rest  of  the  exons 

46  (13  total)  encode  the  regions  common  to  both  p  and  p'  messages.  In  conjunction  with  the  cDNA  sequences,  the  gene  structure 

47  establishes  that  alternative  splicing  of  5'  exons  is  responsible  for  the  generation  of  the  mRNA  isoforms.  The  DNA  regions  between 

48  p  -  and  P-specific  exons  and  upstream  of  the  first  P'  exon  have  been  completely  sequenced  to  identify  potential  regulatory  elements 

49  for  P  and  p  transcription.  There  is  significant  homology  between  the  two  regions,  indicating  that  a  duplication  event  occurred 

50  during  evolution  of  the  Meplb  gene.  Potential  promoter  elements  and  transcription  factor-binding  sites  were  identified  from 

51  comparisons  to  sequences  in  the  databanks.  This  is  the  first  gene  structure  that  has  been  completed  for  meprin  subunits  from  all 

52  species.  The  work  elucidates  molecular  mechanisms  that  regulate  differential  expression  of  the  Meplb  gene.  ©  2000  Elsevier 

53  Science  B.V.  All  rights  reserved. 

54  Keywords:  Chromosome  18;  Differential  expression;  Exon/intron  organization'  Protease 

63  _ _ _ 


64 

65  1.  Introduction 

66  Meprins  are  members  of  the  ‘astacin  family^  and 

67  ‘metzincin  superfamily  ’  of  metalloendopeptidases  (Bond 
g  68  and  Beynon,  1995;  Stacker  et  ah,  1995).  All  the  members 

69  of  the  superfamily  contain  zinc  firmly  bound  at  the 

l  _ 

55  Abbreviations:  AM,  after  MATH;  bp,  base  pairs;  C,  cytoplasmic; 

56  EGF,  epidermal  growth  factor-like;  kb,  kilobases;  LPH,  lactase  phlori- 

57  zin  hydrolase;  MAM,  meprin,  A-5  protein  receptor  protein-tyrosine 

58  phosphatase  p;  MATH,  meprin  and  TRAF  homology;  nt,  nucleotide; 

59  PCR,  polymerase  chain  reaction;  RACE,  rapid  amplification  of  cDNA 

60  ends;  Sr,  sucrase-isomaltase;  TM,  transmembrane;  YAC,  yeast  artifi- 

61  cial  chromosome. 
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catalytic  center,  and  act  extracellularly.  Meprins  are  70 
highly  glycosylated,  disulfide-linked  oligomeric  prote-  71 
ases  composed  of  one  or  two  evolutionarily  related  72 
subunits,  a  and  p.  The  homo-  or  heterooligomers  that  73 
contain  meprin  a  subunits  are  referred  to  as  meprin  A  74 
(EC  3.4.24.18);  meprin  B  (EC  3.4.24.63)  is  a  homooligo-  75 
mer  of  P  subunits.  The  subunits  are  expressed  embryoni-  76 
cally  and  after  birth  in  a  strain-,  tissue-  and  cell-specific  77 
manner;  expression  is  particularly  abundant  in  proximal  78 
tubule  cells  of  mammalian  kidney  and  in  intestinal  79 
epithelial  cells  (Bond  and  Beynon,  1995).  80 

Meprin  subunits  are  also  expressed  in  a  number  of  81 
mouse  and  human  cancer  cells,  and  this  is  of  interest  82 
because  extracellular  proteases  are  capable  of  influencing  83 
the  course  of  growth  and  metastases  (Dietrich  et  al.,  84 
1996;  Lottaz  et  al.,  1999;  Matters  and  Bond,  1999b).  85 
The  5'  end  of  the  human  meprin  p  gene  (MEP1B)  has  86 
recently  been  analyzed,  and  a  PEA3  element  was  iden-  87 
tified  as  being  responsible  for  MEPIB  expression  in  88 
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89  cancer  cells  (Matters  arid  Bond,'  1999b).  Meprins  are 

90  capable  of  degrading  bioactive  peptides  such  as  bradyki- 

91  nin,  gastrin,  substance  P,  neurotensin,  and  TGF-a, 

92  peptide  hormones  such  as  luliberin,  parathyroid  hor- 

93  mone,  a-melanocyte-stimulating  hormone,  and  gluca- 

94  gon,  and  proteins  such  as  protein  kinases,  type  IV 

95  collagen,  laminin,  fibronectin,  and  gelatin  (Bond  and 

96  Beynon,  1995;  Chestukhin  et  al.,  1997). 

97  Meprins  contain  multidomain  subunits.  The  deduced 

98  amino  acid  sequences  of  the  a  or  P  subunits  from  mouse, 

99  rat,  and  human  are  75-90%  identical,  and  the  a  and  p 

100  subunits  from  the  same  species  are  approximately  50% 

101  identical.  The  predicted  domain  structure  of  the  P  sub- 

102  unit  (Fig.  1A)  consists  of  the  following:  S  (N-terminal 

103  signal  peptide),  P  (prosequence).  Protease  domain  (cata- 

104  lytic,  astacin-like),  MAM  (meprin,  A-5  protein,  receptor 

105  protein-tyrosine  phosphatase  p),  MATH  (meprin  and 

106  TRAF  homology),  AM  (after  MATH),  EGF  (epidermal 

107  growth  factor-like),  TM  (transmembrane)  and  C  (cyto- 

108  plasmic).  The  domain  structure  of  the  a  subunit  is 

109  similar  to  the  p  subunit  except  that  an  additional  I 
no  (inserted)  domain  is  present  in  a  between  the  AM  and 
ill  EGF  domains.  The  I  domain  is  essential  for  the 


C-terminal  proteolytic  cleavage  of  the  a  subunit  in  the  112 
endoplasmic  reticulum,  leading  to  the  secretion  of  the  a  113 
subunit  if  not  associated  with  the  P  subunit  at  the  cell  m 
surface  (Marchand  et  al.,  1995).  The  mouse  meprin  p  115 
subunit  remains  membrane-bound  during  biosynthesis,  116 
and  the  mature  subunit  is  localized  to  the  plasma  117 
membrane.  The  MAM,  MATH,  and  AM  domains  of  118 
meprin  subunits  are  essential  for  efficient  transport  of  119 
the  protein  to  the  cell  surface  and/or  correct  folding  to  120 
generate  enzymatically  active  proteases  (Tsukuba  and  121 
Bond,  1998).  122 

The  gene  encoding  the  meprin  P  subunit  exists  as  a  123 
single  copy  on  chromosome  18  of  the  mouse  and  human  124 
genomes  (Bond  et  al.,  1995).  When  two  mRNA  isoforms  125 
were  discovered,  the  2.5  kb  mRNA  (P)  in  the  embryo  128 
and  in  kidney  and  intestinal  cells  and  a  2.7  kb  mRNA  127 
(P')  in  mouse  carcinoma  cells,  it  was  proposed  that  128 
alternative  splicing  is  responsible  for  the  generation  of  129 
the  forms  in  both  mouse  and  human  cells  (Dietrich  130 
et  al.,  1996).  However,  recent  work  indicated  that  131 
alternative  splicing  was  not  involved  in  expression  of  132 
the  human  gene  (Matters  and  Bond,  1999a).  The  work  133 
herein  was  initiated  to  determine  the  exon-in tron  organ-  134 


12  Fig.  1.  Structure  of  the  mouse  meprin  p  subunit.  (A)  Protein  domain  structure  based  on  the  deduced  amino  acid  sequence.  The  predicted  functions 

13  for  the  domains  are:  S,  signal  peptide;  P,  prosequence;  Protease,  catalytic;  MAM  (meprin,  A-5  protein,  receptor  protein-tyrosine  phosphatase  p), 

14  MATH  (meprin  and  TRAF  homology)  and  AM  (after  MATH ),  adhesion  and  interaction;  EGF,  epidermal  growth  factor-like;  TM,  transmembrane; 

15  and  C,  cytosolic.  (B)  Exons  of  mRNA  isoforms  (p  and  p').  Exons  1-3'  are  unique  to  p'  and  exons  I  and  2  are  unique  to  p.  Exons  3-15  are 

16  common  for  both  isoforms.  (A)  and  (B)  are  drawn  to  the  same  scale.  (C)  Exon  and  intron  organization  of  the  gene.  Exon  numbers  are  indicated 

17  above  the  exons  (black  bars).  The  numbers  below  exons  indicate  the  beginning  and  end  of  the  exons  that  correspond  to  the  cDNA  sequences 

18  (Gorbea  et  al.,  1993;  Dietrich  et  al.,  1996).  The  introns  and  the  5'  end  are  represented  by  horizontal  lines.  (D)  Overlapping  PCR  clones.  Clones 

19  PI,  P12,  and  PI 5  were  obtained  directly  from  the  genomic  DNA.  All  other  clones  were  derived  from  YAC  M63G10.  (C)  and  (D)  are  drawn  to 
23  the  same  scale. 
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135  izatlon  for  the  mouse  meprin  p  subunit  gene  (. Meplb ), 

136  thereby  providing  the  structural  basis  for  differential 

137  expression  of  the  two  mRNA  isoforms  in  the  mouse  cells. 

138  2.  Materials  and  methods 

139  2.1.  YAC cloning 

140  Two  mouse  yeast  artificial  chromosome  (YAC) 

141  libraries  (Larin  et  a!.,  1991;  Chartier  et  al.,  1992)  were 

142  kindly  provided  by  Ellen  Brundage  and  Craig  Chinajult 

143  from  the  Cloning  Core  of  the  Human  Genome  Center 

144  at  Baylor  College  of  Medicine.  Multistep  polymerase 

145  chain  reaction  (PCR)  screening  of  the  YAC  libraries 

146  was  performed  with  two  gene-specific  primers, 

147  A G A CTCTGGCTT CTT CAT GC ATTT C  (nt  951-975) 

148  and  CACCACCCTGCTGGCCTGTAGTG  (nt  1141- 

149  1119)  (nucleotide  numbering  the  same  as  that  reported 

150  by  Gorbea  et  al.,  1993).  All  of  the  PCR  reactions  were 

151  carried  out  with  the  GeneAmp  PCR  system  9600 

152  (Perkin-Elmer).  The  library  was  amplified  in  20  pi  reac- 

153  tions  containing  10  pi  of  DNA  and  10  pi  of  mix  (final 

154  concentrations:  1  pM  each  primer,  0.2  mM  dNTP,  1  x 

155  PCR  buffer,  and  0.05  unit  of  AmpliTaq  polymerase). 

156  The  cycling  parameters  were  40  cycles  of  94°C/1  min, 

157  55°C/1  min,  and  72°C/1  min  after  the  initial  denatur¬ 
es  ation  (94°C/5  min).  Subsequent  appropriate  fractions 

159  were  amplified  as  above.  Positive  cells  (2  pi,  double 

160  row/double  column)  were  amplified  with  18  pi  of  mix 

161  containing  components  that  resulted  in  the  same  final 

162  concentrations  as  those  used  in  primary  and  secondary 

163  screening.  The  final  screening  was  performed  in  20  pi  of 

164  mix  with  one  half  of  the  isolated  colony  from  plates. 

165  The  PCR  products  from  primary,  secondary  and  final 

166  screening  were  detected  by  agarose  gel  electrophoresis 

167  followed  by  ethidium  bromide  staining.  The  PCR  pro- 

168  ducts  from  the  tertiary  screening  were  detected  by 

169  Southern  analysis. 

170  2.2 .  PCR  cloning 

171  Exon-specific  primers  were  synthesized  based  on  the 

172  cDNA  sequences  of  the  mouse  meprin  P  isoforms  and 

173  the  partial  gene  structure  of  the  mouse  meprin  a  subunit 

174  (Gorbea  et  al.,  1993;  Dietrich  et  al.,  1996;  Jiang  and 

175  Flannery,  1997).  Additional  gene-specific  primers  were 

176  synthesized,  based  on  newly  determined  intron  sequences 

177  (Table  1).  Gene-specific  primers  were  used  as  primers 

178  (1  pM  each)  in  PCR  with  either  the  genomic  DNA  or 

179  the  YAC  DNA  (4ng/pl)  as  template.  The  following 

180  cycling  parameters,  94°C/2.5  min,  30  cycles  of 

181  94°C/1  min-57.5°C/l .5  min-72°C/l.5  min,  and  72°C/ 

182  5  min,  were  used  to  clone  the  internal  regions  using  two 

183  gene-specific  primers.  In  order  to  obtain  the  5'  and  3' 

184  fragments,  a  PromoterFinder  DNA  Walking  kit 


(Clontech)  was  used.  In  the  primary  PCR,  five  genomic  185 
libraries  in  the  kit  were  amplified  with  gene-specific  186 
primers  and  API  through  seven  cycles  of  94°C/2  s  and  187 
70°C/3  min,  37  cycles  of  94°C/2  s  and  65°C/3  min,  and  188 
65°C/4  min.  The  resulting  products  were  diluted  50-fold  189 
and  amplified  with  nested  gene-specific  primers  and  AP2  190 
in  the  secondary  PCR  using  the  same  cycling  parameters  191 
as  those  used  in  the  primary  PCR  except  that  20  instead  192 
of  37  cycles  were  performed.  The  PCR  products  were  193 
separated  by  agarose  gel  electrophoresis  and  isolated  194 
using  the  GeneClean  procedure  (Biol  01 ).  The  fragments  195 
were  cloned  into  a  plasmid  vector  (pCRII  or  pCR2.1)  196 
using  the  TA  cloning  kit  (Invitrogen).  The  resulting  197 
clones  (P1-P15)  were  cleaved  by  restriction  enzymes  to  198 
determine  the  sizes  of  the  fragments  and  partially  199 
sequenced.  200 

2.3.  Genomic  Southern  201 

Genomic  DNA  (10  jig)  from  C57BL/6  mice  was  202 
digested  overnight  with  EcoRl,  Hindlll  or  Muni  (Life  203 
Technologies).  After  separation  on  a  0.6%  agarose  gel,  204 
the  DNA  was  transferred  to  a  Nytran  Plus  nylon  205 
membrane  (Schleicher  and  Schuell)  and  UV-cross-  206 
linked.  The  membranes  were  probed  overnight  with  207 
PCR-labeled  DNA  from  either  mouse  Meplb  exon  p'3'  208 
or  Meplb  exon  pi  at  42°C  in  5x  SSPE/50%  209 
formamide/5  x  Denhardts/1%  SDS/lOOpg/ml  of  her-  210 
ring  sperm  DNA.  Blots  were  washed  with  2x  211 
SSPE/0.1%  SDS  at  50°C,  0.2  x  SSPE/0.1%  SDS  at  212 
55°C,  and  0.1  x  SSPE/0.1%  SDS  at  55°C,  and  exposed  213 
to  Kodak  X-OMAT  Blue  film  for  3-7  days.  214 

2.4.  Sequencing  215 

The  ends  of  the  clones  were  sequenced  directly  using  216 
the  primers  (SP6,  M13  reverse  primer,  and  T7)  for  the  217 
vector  (pCRII  or  pCR2.1).  Internal  regions  were  218 
sequenced  by  either  generating  deletion  clones  with  219 
restriction  enzymes  or  using  gene-specific  primers  syn-  220 
thesized  based  on  the  determined  sequences.  Both  221 
manual  and  automated  sequencing  were  performed  on  222 
the  double-stranded  DNA.  Manual  sequencing  was  per-  223 
formed  with  Sequenase  2.0  (Amersham).  Automated  224 
sequencing  was  performed  with  AmpliTaq  DNA  poly-  225 
merase,  FS,  using  ABI  Prism  automated  DNA  sequencer  226 
(Perkin-Elmer)  in  the  Molecular  Genetics  Core  Facility  227 
of  the  Penn  State’s  College  of  Medicine.  Clones  PI  and  228 
P4  were  completely  sequenced.  Other  clones  were  par-  229 
tially  sequenced,  and  exon/intron  boundaries  were  230 
sequenced  on  both  strands.  231 

2.5.  Sequence  analyses  232 

The  determined  genomic  sequences  were  compared  233 
to  the  cDNA  sequences  of  p  and  P'  mRNA  to  identify  234 
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Table  1 

Oligonucleotide  sequences  of  the  primers* 
Clone  Size  Sense  Primer 


GTAATACGACTCACTATAGGGC(API) 

ACTATAGGGCACGCGTGGTCAP2 ) 

CTGGCTGGTCTCAACAAT 

CCTTCTCCCCTTTTCTTT 

AGCAGAAGCAGACACAGC 

ATGGATGCCCGGCATCAGCC 

CAAAGACATAGATGGAGG 

GGTCTGGACCTTTTTGAG 

GCCACATACCATTCCATA 

CGTGCATTGACTTCAAGCCTTGGT 

GGTCTTCAGTGGGAAACATTCATG 

GCACTACAGTAAAACCGCTTTCCA 

AGACTCTGGCTTCTTCATGCATTTC 

ACAAGGTGTCCTCACCATATC 

CTGCAGGAGAAGACTGGT 

CAGCTGGGAGGGATGTCAGGAATCTG 

CACGCCAGCAGGTGAAATGAAAAGAG 


Position  in 
cDNA 

Not  applicable 
Not  applicable 
14'-31' 
224-241' 
302-319' 

31-50 

114-131 

169-186 

246-253 

338-361 

407-430 

660-683 

951-975 

1306-1326 

1931-1948 

Intron  after  pl4 

2156-2180 


Antisense  Primer 


GAGCTCCAAACCAGCAGTGCTTCTTCC 

CAGGCGTTGGTTCGCCAAATTGTTG 

CGGGTTGAGAAATAATGG 

ATCATGTACATCCCGTCC 

GGAGAAATGTGGCAAAAA 

TGTCTTGGTCAATTCCTCCAT 

GAGTTTGATGTCTCCCTC 

GGCCATCTCTTGTGGTCT 

GCCCTTGAACACTGAGAT 

TCCAATGGACAACTCCTGCTTC 

ATGGTAGACTCTGTCCCG 

CGGGTGTAAATGTTCCAGTTG 

CCAGATATGGTGAGGACACCTTGT 

CTCACACCTTTTGCCCATGTA 

TTCTTCTATGATGGAAGGTCTCTTT 

GTAATACGACTCACTATAGGGC(AP1 ) 

ACTATAGGGCACGCGTGGKAP2) 


Position  in 
cDNA 

96-70' 

50-25' 

Intron  after  P'2' 
372-355' 

80-73 

145-125 

201-184 

248-231 

396-379 

456-435 

704-687 

1111-1093 

1329-1306 

1971-1951 

2199-2175 

Not  applicable 

Not  applicable 


158  *  Positions  of  the  primers  (nucleotide  number)  are  based  on  the  published  cDNA  sequences  (Gorbea  et  a!.,  1993;  Dietrich  et  al.,  1996).  API  and 

155  AP2  are  adaptor  primers  for  the  genomic  walking.  Clones  (P1-P15)  are  numbered  from  the  5'  end  of  the  gene.  PI,  P12,  and  PI  5  were  generated 

156  from  the  genomic  DNA;  other  clones  were  generated  from  the  YAC  DNA.  Sizes  of  the  clones  (kilobases)  were  estimated  from  the  restriction 
[jQ  158  analysis  of  the  plasmid  DNAjL 


235  boundaries  between  exons  and  introns.  Multiple 

236  sequences  were  aligned  using  Clustal  W  (Thompson 

237  et  al.,  1994).  Potential  promoter  regions  were  predicted 

238  using  the  NNPP  (Promoter  Prediction  by  Neural 

239  Network)  method  (http://www-hgc.lbl.gov/projects/ 

240  promoter.html).  The  basis  for  this  method  is  a  time- 

241  delayed  neural  network  that  consists  mainly  of  two 

242  feature  layers,  one  for  recognizing  the  TATA  box  and 

243  one  for  recognizing  the  ‘initiator’,  which  is  the  region 

244  spanning  the  transcription  start  site.  The  resulting  pre- 

245  diction  is  a  50  bp  region  with  the  transcription  start  site 

246  at  base  41  with  the  positional  accuracy  ±3  bp.  Potential 

247  transcription  factor-binding  sites  were  predicted  from 

248  the  TRANSFAC  database  (version  3.5)  using 

249  Matlnspector  (Quandt  et  al.,  1995). 


250  3.  Results  and  discussion 

251  3. 1.  Cloning  of  the  Meplb  gene 

252  To  begin  the  analysis  of  the  mouse  meprin  p  gene 

253  structure,  three  clones  (M324G12,  M66E3,  and 

254  M63G10)  were  isolated  from  53000  clones  by  multistep 

255  PCR  screening  of  two  YAC  libraries.  Southern  analysis 

256  of  the  three  clones  using  the  full-length  mouse  P  cDNA 

257  revealed  that  they  all  contained  the  mouse  Meplb  gene 

258  (data  not  shown).  Twelve  fragments,  sizes  ranging  from 

259  0.8  to  5  kb,  were  amplified  by  PCR  from  YAC  M63G10. 

260  The  resulting  plasmid  clones  were  designated  as  P2-P1 1, 

261  P 1 3,  and  P 1 4  (Table  1 ).  Clone  P 1 2  was  amplified  directly 


from  the  genomic  DNA  as  described  previously  262 
(Dietrich  et  al.,  1996).  Clones  PI  and  P15  were  generated  263 
by  genomic  walking.  Sequencing  showed  that  all  these  264 
clones  were  overlapping  and  constituted  a  continuous  265 
genomic  fragment  of  approximately  40  kb.  266 

3.2.  Exon/ intron  organization  of  the  Meplb  gene  267 

The  exon  and  intron  organization  of  the  Meplb  gene  268 
is  presented  with  the  domain  structure  of  meprin  p  269 
subunit  protein  and  two  mRNA  isoforms  (p  and  p')  270 
(Fig.  1).  There  is  no  correspondence  between  exons  and  271 
protein  domains  (Fig.  1A  and  B);  some  domains  (e.g.  272 
the  protease  domain)  are  encoded  by  several  exons,  and  273 
others  (e.g.  the  MATH  domain)  are  contained  within  274 
one  exon.  The  exon/intron  boundaries  of  the  Meplb  275 
gene  corresponding  to  the  protease  domain  are  con-  276 
served  in  the  Mepla  gene  encoding  the  mouse  meprin  a  277 
subunit  (Jiang  and  Flannery,  1997).  Fifteen  and  16  278 
exons  constitute  the  P  and  P'  mRNA,  respectively  279 
(Fig.  IB).  The  last  13  exons  are  common  to  both  p  and  280 
P'  mRNA.  The  P'-specific  exons  (PT  to  P'3')  precede  281 
the  P-specific  exons  (pi  and  p2).  This  type  of  organiza-  282 
tion  provides  evidence  for  the  proposition  that  pro-  283 
duction  of  the  p'  mRNA  isoform  involves  alternative  284 
splicing  of  the  p-specific  exons  from  the  mRNA  285 
precursor.  288 

Exon/intron  junctions  of  the  mouse  meprin  P  subunit  287 
gene  are  presented  in  Table  2.  All  the  introns  match  the  288 
‘gt-ag’  consensus  sequence  (Hor/witz  and  Krainer,  289 
1994).  The  sizes  of  the  introns  range  from  0.5  to  4.9  kb,  290 
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161  Table  2 

162  Exon/intron  junctions  ofthe  mouse  meprin  p  subunit  gene* 

163  /^Nucleotides  of  exons  and  introns  are  in  upper-  and  lower-case  letters,  respectively.  Corresponding  amino  acids  are  indicated  as  one-letter  codes 

165  j  below  the  first  nucleotides  of  the  respective  codons.  The  sizes  of  introns  (kb)  are  indicated.  Phases  of  introns  refer  to  position  of  an  intron  relative 

166  Ito  codon.  Phases  0,  I,  and  II  indicate  the  presence  of  an  intron  between  two  codons,  between  the  first  two  nucleotides  of  a  codon,  and  between 

167  Athe  last  two  nucleotides  of  a  codon,  respectively  (Patthy,  1987).  The  size  of  4.9  kb  includes  exons  pi  and  p2. 


r1COtrV3bC) 


Exon-Exon _ Splice  Donor _ 

6'1#-  GAGAAG  gttggtacgg 


Size  (kb) _ Splice  Acceptor 

4.7  cctcttcaag  ATACTG 


TATAAG  gtgtgttccc 


gtgacaccag  GTTTCC 


151-62 


11-12 


12-13 


13-14 


TTGGAG  gtaagctaaa 
G  V _ _ _ 

GGTTTG  gtaagaaaat 

G  L 

AGTTTG  gtaagtctat 
F  V _ 

ACCAAG  gtttgtggct 

Q  G 

CTCGAG  gtgagttgca 

L  E 

GCTTGG  gttagtacac 
L  E 

CAGTGG  gtaagttcga 
S  G  . 

AGCCAG  gtatgtttct 
P  G 

ACTGCA  gtatgtgatg 
C  T 

GCAAAG  gtaacaggtt 
K  D 

TAAAAG  gtacagtacc 
K  E 

CCAGTG  gttcgtggct 
S  D 

TTGAAG  gtatcgaaat 
E  D 

GTGCAA  gtgaggactc 
C  K 

GAAAAC  gtaagttgag 


gtaatttcag  TCAAAG 
_ K  D 

tgtttcccag  CCAGCT 

Jfetaattpag  TCAAAG 
_ K  D 

cttctcttag  GTTTGG 
L  G 

tttttgacag  GCAAAT 
A  N 

cttctgtcag  AAATGA 
M  N 

tctgtatcag  GTGCTG 

c  w 

atatattcag  GCAAGG 
K  E 

gcttttgtag  CTTCTT 
S  S 

ctctgagcag  ACTCTG 
S  G 

ctttctgcag  AGGTAC 
V  P 

tgcttttcag  ATAATG 
N  G 

ccctctgcag  ACATAT 
I  S 

cttcctgtag  GTGTCC 
C  P 

ctcctttcag  CAACAT 
_ Q  H 


291  as  compared  to  the  sizes  of  the  exons  from  19  (P2)  to 

292  444  (exon  1 1)  bp  (Fig.  1C).  Less  than  6 %  of  total  DNA 

293  (40  kb)  are  exonic  sequences.  The  majority  of  the  introns 

294  (1  1  out  of  15  that  are  present  in  the  coding  region)  are 

295  phase  I  introns,  and  there  are  three  phase  0  and  two 

296  phase  II  introns,  respectively  (Patthy,  1987). 

297  3.3.  Mouse  genomic  Southern 

298  Because  the  genomic  structure  of  the  Meplb  gene 

299  was  derived  from  PCR-amplified  YAC  clones,  a  genomic 

300  Southern  analysis  was  used  to  confirm  the  location  of 

301  the  (3'  exons  in  mouse  DNA.  Based  on  the  YAC 

302  sequences,  a  partial  restriction  map  of  the  first  three  P' 

303  exons  and  the  first  five  p  exons  was  made  and  used  to 

304  select  enzymes  for  cleaving  the  genomic  DNA  (Fig.  2A). 

305  As  predicted  from  the  map,  the  same  Hindlll  and  Muni „ 


fragments  were  detected  by  the  p'3'  exon  probe  and  by  306 
the  pi  exon  probe  (Fig.  2B).  This  indicates  that  the  two  307 
exons  are  adjacent  to  each  other  in  the  genomic  DNA.  308 
The  YAC  clone  would  also  predict  that  the  p'3'  exon  309 
probe  and  the  pi  exon  probe  would  hybridize  to  different  310 
EcoR  I  fragments.  The  different  £a?RI  fragments  31 1 
detected  by  the  P'3'  exon  probe  and  the  pi  exon  probe  312 
indicate  the  the  P'3'  exon  is  upstream  of  the  pi  exon.  313 
Taken  together,  the  genomic  Southern  results  confirm  314 
that  the  P'  exons  are  not  artifacts  of  YAC  cloning  or  315 
PCR  amplification  and  exist  in  the  mouse  genome  in  316 
the  same  location  as  in  the  YAC  cloned  DNA.  317 

3.4.  Potential  promoter  elements  for  the  ft  isoform  318 

The  1.6  kb  sequence  including  the  region  immediately  319 
upstream  of  the  first  p-specific  exon  was  determined  320 
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Fig.  2.  Mouse  genomic  Southern.  (A)  Schematic  of  the  intron-exon  organization  of  the  5'  end  of  the  mouse  Meplb  gene.  Boxes  represent  the 
exons,  and  restriction  sites  are  indicated  by  the  vertical  lines.  Abbreviations  for  the  restriction  sites:  RI  =£coRI,  M  =  A/w/ri,  H3=//mdIII.  (B) 
Genomic  Southern  blots  of  C57BL/6  mouse  DNA  digested  with  £a?RI,  Hindlll  or  Muni.  Molecular-weight  markers  (kbp)  are  listed  at  the  left. 
The  membrane  on  the  left  was  hybridized  with  a  probe  to  Meplb  exon  p'3',  while  the  membrane  on  the  right  was  hybridized  with  a  probe  to 
Meplb  exon  pi. 


321  from  clone  P4  because  this  region  may  contain  potential 

322  promoter  and  other  regulatory  elements  for  transcrip- 

323  tion  of  meprin  (3  mRNA  in  kidney  and  intestinal  cells 

324  (Fig.  3).  Three  potential  promoters  were  predicted  for 

325  p  using  the  NNPP  method  with  the  score  cut  off  of  0.8. 

326  They  are  located  in  the  regions  of  nt  502-551,  632-681, 

327  and  1492-1541  (indicated  by  dotted  lines)  with  the  score 

328  of  0.92,  0.99,  and  0.95,  respectively.  The  transcription 

329  start  site  of  the  last  promoter  region  (adenosine,  A, 

330  double-underlined)  coincides  with  the  5'  end  of  cloned 

331  p  cDNA  obtained  by  the  5'  RACE  procedure  (rapid 

332  amplification  of  cDNA  ends)  (Gorbea  et  al.,  1993), 

333  consistent  with  the  proposition  that  this  is  the  site  for 

334  initiation  of  transcription.  Therefore,  the  nucleotide  A 

335  is  designated  as  the  first  base  in  the  pi  exon. 

336  Many  potential  transcription  factor-binding  sites 

337  were  predicted  in  the  1.5  kb  sequence  upstream  of  the 

338  pi  exon  from  the  TRANS  FAC  database  (version  3.5) 

339  using  Matlnspector  (Quandt  et  ah,  1995).  Fig.  3  indi- 

340  cates  the  sites  that  matched  the  consensus  sequences 

341  with  100%  core  similarity  and  at  least  95%  matrix 

342  similarity.  Transcription  factors  HFH-3,  CREB  and 

343  GATA-1  are  strong  candidates  for  regulating  kidney 

344  and  intestinal  expression  of  the  Meplb  gene  based  on 

345  the  following  observations.  HFH-3  is  a  winged  helix 


transcriptional  activator  expressed  in  the  distal  tubules  346 
of  embryonic  and  adult  mouse  kidney,  and  the  HFH3  347 
site  is  found  in  several  kidney-specific  genes  such  as  348 
Na/K-ATPase  and  E-cadherin  and  transcription  factors  349 
such  as  HNF-1  and  HNF-4  known  to  regulate  gene  350 
expression  in  kidney  and  intestine  (Traber  and  Silberg,  351 
1996;  Overdier  et  ah,  1997).  CREB  (cAMP  response  352 
element-binding  protein)  interacts  with  an  intestinal  353 
homeodomain  transcription  factor  Cdx2  and  enhances  354 
Cdx2-dependent  transcriptional  activity  (Lorentz  et  ah,  355 
1999).  Several  members  of  GATA-binding  proteins  are  356 
implicated  in  gene  regulation  in  intestine  (Laverriere  357 
et  ah,  1994).  358 

There  are  also  factors  upstream  of  the  pi  exon  that  359 
may  be  important  for  developmental  regulation  of  the  360 
Meplb  gene.  For  example,  the  murine  S8  homeobox  361 
gene  is  expressed  in  a  mesenchyme-specific  pattern  in  362 
embryos  and  in  regions  involved  in  epithelio-mesenchy-  363 
mal  interactions  (de  Jong  et  ah,  1993).  Brn-2  is  a  364 
developmental  regulator  containing  a  POU  domain  (Li  365 
et  ah,  1993).  Other  potential  transcription  factors  recog-  366 
nizing  the  sites  indicated  in  Fig.  3  have  been  shown  to  367 
regulate  a  variety  of  genes.  For  examples,  Sox-5  is  a  368 
novel  murine  gene  related  to  SRY,  the  testis-determining  369 
gene,  and  highly  expressed  during  spermatogenesis -;37Q 
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AGCAGAAGCA  GACACAGCTT  CAAATGCAGG  ATGAAGCTCC  TGAAGGCTCC  CAGGGACGGG  ATGTACATGA  TGACATTTGG  AOGTAAGCTA  AACCCCTCCC  100 

MZF1 

CTCAGGTTGT  GTTCCTGTAG  GAGGATTTCA  CTGAAACCCA  GGACAGAAGA  reATAGAGAC  GTCATCCATC  CAGCCCAGGC  AGAATGCAGC  TTTTGCAGTT  200 

CREB/TCP11 

GGATTCAGCT  GTGCTTCGTG  AACTTTCTAT  GAGACTCCTA  CTCATACTTG  GGTCATACCC  TATAATTAGA  QTGGAAAATA  GACTCTCCTG  TCCCAGGAAC  300 
HflPl  S8  NPAT 

tgctgcttcc  tgctctttct  gcttctcttt  tcctactttc  aaatgatagc  tgtctgtcct  aatagataga  GTGCAGTTAG  GTGTTTTAGG  CACACATTGC  400 

HTXT  LM02C0M 

AGCTTGGAAA  GCCAGATGTT  CATAATTTGA  TCTTCCTCAC  TAAAAGATCT  AACTTTATCT.JTCCTTTATTT  CTAAAAGAAG  CtTTTCTTTCT  TTSTATTTGA  500 

LM02C0M  PKH40/HPH3 

CTG  P.T.  G TAT ATATgG  „  SPATATAJS&.  .C  ATATGT  ACC  .  T^TGC  CTGC.  AGAGGCC AG A  AGAGGGCATC  AGAGTCTCTG  GAATTCGAAT  CACAGACACC  600 

TATGATCTTC  TGTGTGGGTA  TTAAGG ATGG  CGCCTGGGTC _ CTTTAAAGAT. .GGGCC ACTGC , TCTTCATGGC, . TTCC AGCCTC  CCTTTATTTT  700 

BRN2  s 

TCTTTAAGAg  TCATTATTAG  TTCC ATGG AG  GTAGTAGAAA  TTTCACTGCA  GCCAATGAGG  GGCAAGCTAT  TTTTGTGTAT  GTGTTGCTGG  AGATCAAATG  800 

TCF11  HSP1  CAAT/NFY 

CGGGGCCTTG  TAAATGTAAG  GCAAGGACTC  TGCTGCTGAA  GTTTATCTCC  AACACCGAAT  AAATGATTCT  GTGCCCAAGT  TTAAAGTCCA  GAGGTGTTGG  900 


CAGGGCCACA  TATGATCCCT  GATGTTGTCT  CTCCTCAGGC 


TTAGTTACCA  TCCTAGGTAG  TCTCATTATG  GAAATATTCT  1000 


TATTTCCACG  TGTGTGACCC  TTCAGGGTGG  GTTCATTCTC  TGGCATGTAT  GGATTACTTT  TGGCCTCACA  GGAGTTATTC  TCTTCCTTCC  TAATACCCTC  1100 
USP  /  TCFl  1* 

TTGACTCTGT  ATTGTAGCTT  CTCATGGCCC  TAGTCAGGAA  ACAGCTATTT  GATTCCGGAT  TTAATAAACT  ATTATTAGAG  TCTGAGGAAG  ACAGTGAGAG  1200 

GTTGGTTGAT  CCACATACTC  TGTGCTTGTG  TGTGTGTGTG  TGTGTGTGTA  CACATACATG  TATGTTTCTT  TTTGCTQftAA  CAATGXTAGT  AATGTTGOQA  1300 

Sox5  IK2 

AGCAATATGT  ATACATCAGA  GAATATGCCT  CAGTGACAAG  TTTTAGTTGT  TCAATGTTGG  CACAGTAAAG  AGGGGGAAAA  AGAGGTGTGT  CAGCAACCCT  1400 

GGACTTTAGC  TTCTTGATTG  GAAGTTACTA  TAGCAGCAAG  ATTACTTQAC  AQATAGATCA  TTAACATTAA  GATCAAAGGC  CGGMGXXAT  GATGTTTAAC  1500 

GXTA1  CBTS1P54 

TTTTAAAAGT  CAGCTCTGCC. ATGACT^CA.^GCTTOCAG. CTTTCGTCTG  GAAGCCACAG  T&XQGATGCC  CGGCATCAGC  CTTGGTTTCT  GGTTTTTGCC  1600 

T " 

ACATTTCTCC  1610 

38 

44  Fig.  3.  Mouse  Meplb  sequence  between  exons  P'3'  and  pi.  The  complete  sequence  of  clone  P4  (Fig.  ID)  is  numbered  on  the  right.  The  sequences 

45  of  the  primers  used  to  generate  this  clone  correspond  to  the  first  and  last  18  nucleotides,  respectively  (Table  1 ).  The  partial  sequences  of  exons  p'3' 

46  (1-82)  and  pi  (1532-1610)  are  italicized,  and  the  translation  start  codon  for  the  p  protein  is  underlined.  The  end  of  exon  p'3'  and  the  start  of 

47  exon  pi  are  marked  by  vertical  bars.  Potential  promoter  regions  are  indicated  by  dotted  lines  with  transcription  start  sites  double-underlined. 

48  Potential  transcription  factor-binding  sites  are  indicated.  The  sequence  has  been  deposited  in  GenBank  with  Accession  No.  AF1 60982. 


371  (Denny  et  al.,  1992).  MZF1  and  Lmo2  play  a  role  in 

372  hematopoiesis  (e.g.  Perrotti  et  al.}  1995).  IK-2  is  one  of 

373  the  zinc  finger  DNA-binding  proteins  encoded  by  the 

374  lymphocyte-restricted  Ikaros  gene,  the  master  regulator 

375  during  lymphocyte  development  (Molnar  and 

376  Georgopoulos,  1994).  In  contrast,  TCFl  1  is  more  widely 

377  expressed,  and  HSF-1  is  a  member  of  the  heat-shock 

378  transcription  factors  known  to  function  in  the  cellular 

379  stress  response  (e.g.  Johnsen  et  al.,  1998).  The  oncoprot- 

380  ein  c-Ets-1  (p54)  that  binds  the  CETS1 P54  site  is  known 

381  to  activate  the  promoter  of  a  matrix  metalloproteinase 

382  stromelysin  (Wasylyk  et  al.,  1991).  The  NFY-binding 

383  protein,  NF-Y,  is  the  major  CCAAT  box  recognizing 

384  protein  that  may  serve  different  roles  in  TATA-contain- 

385  ing  and  TATA-less  promoters  (Mantovani,  1998). 

386  3.5.  Comparison  of  the  5'  upstream  sequences  of  jS  and  ft 

387  and  potential  promoter  elements  for  the  ft  form 

388  The  0.5  kb  sequence  containing  the  region  immedi- 

389  ately  upstream  of  the  first  (3 '-specific  exon  was  deter- 

390  mined  from  clone  PI.  Fig.  4  indicates  potential 


transcription  factor-binding  sites  in  the  P'  upstream  391 
region  that  matched  the  consensus  sequences  in  the  392 
TRANSFAC  database  (version  3.5)  with  100%  core  393 
similarity  and  at  least  95%  matrix  similarity  using  394 
Matlnspector  (Quandt  et  al.,  1995).  Four  of  the  sites,  395 
HSF1,  GATA1,  S8  and  IK2,  are  also  present  in  the  p  396 
upstream  region,  but  their  positions  are  not  conserved.  397 
The  GATA 1  site  is  also  present  in  the  promoter  region  398 
of  the  human  gene  encoding  matrix  metalloproteinase  399 
matrilysin  (Wilson  and  Matrisian,  1998).  Two  sites,  400 
TATA  and  CDPCR3HD,  are  unique  to  the  P'  upstream  401 
region.  The  CDPCR3HD  site  is  recognized  by  a  Cut-  402 
like  protein  that  belongs  to  a  distinct  class  of  homeodo-  403 
main  proteins  with  multiple  DNA-binding  domains  and  404 
acts  as  a  negative  regulator  of  gene  expression  (Harada  405 
et  al.,  1995).  406 

The  sequence  consisting  of  the  upstream  region  and  407 
the  p'-specific  exons  is  42%  identical  to  the  p  sequence  408 
in  more  than  900  bp  (Fig.  4).  The  degree  of  homology  409 
between  the  two  different  regions  of  the  Meplb  gene  410 
indicates  that  a  duplication  event  occurred  during  the  411 
evolution  of  this  gene.  Following, the  duplication  event,  412 


1 

2  . . .  Gene  12415  —  CAP  Auto  Page  Proofs— Page -7 

1  agmgen2415  28-03-:0  16:38:10  RevT2.T3x  -  :  •  --  -  “  -  _ 

3  "  * 


8 


2 


W.  Jiang  et  ai  /  Gene  00  (2000)  000-000 


S'  .AGTACTCTCT  GCAGGCAAGC  TCTT--CTCT  TGCAGGGAAG  GTGCCCAGAT  ATCTGGTGTT  CGAACCTGCC  TCCTGGCAGA  AGCTGTGTTC  CATTCACCAG  98 
S  GTGTGGGTAT  TAAGGATGGC  GCCTGGGTCC  TTTAAAGATG  GGCCACTGCT  CT-TGATGGC  CGTGTCTTCT  TTCCAGCCTC  CCTTTATTTT  TCTTTAAGAG  710 

*  *  ***  **  *  *  **  *  *  **  *  *  *  *  *  *  *****  **  ***  *  *  *  *  *  *  *  ****** 


HSF1 

S'  AGGTCTTAAG  ATCCTGTGGA  GGGTCCTGTG  AGGACCTTGG  GGATGTCCAG  AAACTTCTTG  CGCTAGGAAC  CCTGGTGCTG  GAGTGGACCA  GAAGGGACTT  198 
15  TCATTATTAG  TTCCA-TGGA  GGTAGTAGAA  ATTTCACTGC  AGCCAATGAG  GGGCAAGCTA  TTTTTGTGTA  TGTGTTGCTG  GAGATCAAAT  GCGGGGCCTT  809 

*  *  **  ***  ****  **  *  *  **  **  *  *  *  *  **  *****  ***  *  *  ***  *** 

GATA1  88  CDPCR3HD 

S'  GTGCCCCTGA  TCAGGCCGTG  TTATCTGCTT  CTTTAATTAA  TCCAGTCTCA  GGTCCCGATC  AATTGGATTG  GAGCAGATGC  TGTGTTCTAC  TCACCAGAGG  298 

S  GTAAATGTAA - GGCAAG  GACTCTGCTG  CTGAAGTTTA  TC - TCC  AACACCGAAT  AAATGATTCT  GTGCCCAAGT  T - TAAAG  TCCAGAGGTG  896 

**  *  *  *  *  *  ******  **  *  **  *  **  **  ****  **  **  *  *  **  *  *  *  *  *  **  **  * 


S'  TCTTAAGATC  CCGTGGGGGA  TCCTG - T-  GTGAGTCCTT  GTGGGTGTCC  GGAGACTCTG  AGGGCAAGGA  ACCCTGGT-G  CTGGAGT G--  GACGGGAAGG  391 

S  TTGGCA GGGC  CACATATGAT  CCCTGATGTT  GTCTCTCCTC  AGGCATCCTT  GGAAAATTAT  TCCTTTAGTT  ACCATCCTAG  GTAGTCTCAT  TATGGAAATA  996 
*  **  *  *  *  ****  *  **  ****  *  *  ***  *  *  **  ***  ******  *  **  ** 


TATA  1X2 

S'  GACTTGTCTC  CACACCTCTT  TTA — TAAGG  CGTGTTTCCC  AACCTTAAAA  ATTTGAACCT  TGAACAGACT  AGCATGTCTT  GGTCCCATTA  ATTCTCAACC  489 
S  TTCTTATTTC  CACGTGTGTG  ACCCTTCAGG  GTGGGTTCAT  TCTCTGGCAT  GTATGGATTA  CTTTTGGCCT  CACAGGAGTT  ATTCTCTTCC  TTCCTAATAC  1096 


** *  *  **  *** 


*  *★  **  *  **  **  *  *  *  **  *  * 


S'  1' 


S'  CGTCTAGACT  CCCTCTTTT-  -CTTC-CAGA  TTCCAAGATG  CCTTCCAGGC  ACTGCGAACC  CjJCACATGTG  AGCATCTCGC  CAGC-TCCCA  ACAATTTGGC  585 
S  'CCTCTTGACT  CTGTATTGTA  GCTTCTCATG  GCCCTAGTCA  GGAAACAGCT  ATTT-GATTC  CGGATTTAAT  AAACTATTAT  TAGAGTCTGA  GGAAGACAGT  1195 


*  ***  ****  *  *  **  *  ****  **  **  ** 


*******  *  *  *  *****  **  * 


S'  GAACCAACGC  CTGGG—AAA  GGCAGAGGAC  ATTTGGAAGA  AGCACTGCTG  GTTTGGAGCT  CCATGGAAGA  AGAAAAGAA-  TTAAAGAAAA  TTCGTACCGG  682 

S  GAGAGGTTGG  TTGATCCACA  TACTCTGTGC  TTGTGTGTGT  GTGTGTGTGT  GTGTACACAT  ACATGTATGT  TTCTTTTTGC  TGAAACAATG  TTAGTAATGT  1295 

**  *  **  *********  ********  ****  *  *  *  ***  **  **  ***  * 

I  S  '  2  ' 

S'  AGAAC/ATACT  GCTACCTGCT  GAGAGGCCTG  TCCTGCACGT  CATCACCTGC  AGCTGGTTCC  AGGCTCCAAG  GATGAATCGG  TGGGAATGGA  CCT-TCTCCC  781 

S  TGGGAAGCAA  TATGTATACA  TCAGAGAATA  TGCCTCA-GT  GACAAGTTTT  AGTTG-TTCA  ATGTTGGCAC  AGTAAAGAGG  GGGAAAAAGA  GGTGTGTCAG  1393 

*  *  *  *  *  *********  **  **  ***  *  *  *  *  ***********  *  *  ** 

I  S '  3  ' 

S'  CTTTTCTTTA  --TAAOGTTT  CCAGTTGATG  ATGTTGCCTA  GGGAGACATG  AATAGC—AC  AGCTGGACCA  GCAAGCAGAA  GCAGACACAG  CTTCAAATGC  877 

S  CAACCCTGGA  CTTTAGCTTC  TTGATTG--G  AAGTTACTAT  AGCAG-CAAG  ATTACTTGAC  AGATAGATCA  TTAACATTAA  GATCAAAGGC  CGGAAGTTAT  1490 

*  **  *  *  **  **  ***  *  *  ***  *  *  **  **  *  *  **  **  **  *  **  **  **  **  *  *  *  *  *  * 


52 

58 

59 

60 
61 
62 
63 
65 


S'  AGGATGAAGC  TCCTGAAGGC  TCCCAGGGAC  GGGATGTACA  T-GATGACA -  -TTTGGAG  932 

S  GATGTTTAAC  TTTTAAAAGT  CAGCTCTGCC  ATGACTGCCA  TH.GCTTGCAG  CTTTCGTCTG  GAAGCCACAG  TATGGATGCC  CGGCATCAGC  CTTGGTTTCT  1590 

*  * . *  *  *  *  **  1  ;x*  **  ***  * 

Fig.  4.  Comparison  of  Meplb  genomic  sequences  upstream  of  g'  and  g  isoforms.  The  sequences  were  aligned  using  Clustal  W  (Thompson  et  al.f 
1994).  The  g  sequence  is  numbered  the  same  as  in  Fig.  3.  For  the  g'  sequence,  the  genomic  part  (1-599)  is  derived  from  clone  PI,  and  the  cDNA 
part  (600-932)  is  derived  from  exon  gT-g'3'  corresponding  with  the  published  sequence  of  nucleotides  51-383  (Dietrich  et  al.,  1996).  Identical 
nucleotides  between  the  two  sequences  are  indicated  below  the  g  sequence  by  asterisks.  Potential  transcription  factor-binding  sites  for  the  g' 
sequence  are  underlined  and  indicated  above  the  g'  sequence.  The  sequences  found  in  the  cDNA  are  italicized,  and  the  translation  start  codons 
are  indicated  by  short  horizontal  bars.  The  start  of  each  exon  is  marked  by  vertical  bars.  The  sequence  of  clone  PI  has  been  deposited  in  GenBank 
with  Accession  No.  AFI 60983. 


413  either  new  sequences  (introns)  were  inserted  between 

414  the  p -specific  regions,  or  the  corresponding  sequences 

415  were  eliminated  from  the  5'  upstream  region  of  p. 

416  3.6.  Comparison  of  the  regulation  of  meprin  /?  expression 

417  in  mouse  and  human  cells 

418  To  compare  the  potential  tissue-specific  elements 

419  identified  for  the  mouse  and  human  meprin  P  genes,  the 

420  sequences  from  the  two  species  upstream  the  P  mRNA 

421  isoforms  were  aligned  (Fig.  5).  The  low  homology  is 

422  reflected  by  many  and  large  gaps  introduced  for  the 

423  optimal  alignment  of  the  two  sequences.  Furthermore, 

424  the  putative  intestine-specific  elements  indicated  for  the 

425  human  gene,  SI/cdx2  and  LPH/cdx2,  are  not  conserved 

426  in  the  mouse  gene,  and  the  putative  kidney-specific 

427  elements  identified  for  the  mouse  gene,  HFH3,  CREB 

428  and  GATA1  (Fig.  3),  are  not  found  in  the  human  gene. 

429  However,  several  short  motifs  (six  to  10  nucleotides)  are 


conserved  in  the  mouse  and  human  genes  (Fig.  5).  It  is  430 
possible  that  these  motifs  represent  novel  intestine-  and  431 
kidney-specific  elements  for  expression  of  the  meprin  P  432 
genes  in  the  two  species.  433 

The  factors  responsible  for  expression  the  meprin  P'  434 
mRNA  isoforms  in  human  and  mouse  cancer  cells  may  435 
be  different.  For  example,  an  API /PE  A3  site  implicated  436 
in  expression  of  human  P'  (Matters  and  Bond,  1999b)  437 
is  only  partially  conserved  in  the  mouse  gene  (Fig.  5).  438 
No  element  known  to  affect  gene  expression  in  cancer  439 
cells  was  identified  for  the  mouse  gene  in  the  500  bp  440 
sequence  upstream  of  the  P'  mRNA  isoform,  indicating  441 
either  that  elements  are  located  further  upstream  or  that  442 
there  are  novel  elements  for  expression  of  the  mouse  443 
gene  in  cancer  cells.  444 

The  results  herein  indicate  that  the  mechanisms  445 
responsible  for  expression  of  the  meprin  P  gene  in  cancer  446 
cells  from  mouse  {Meplb)  and  human  (MEP1B)  are  447 
different  (Fig.  6).  Alternative  splicing  is  involved  in  448 
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1/2  ERE 


99 


mepib  cgacctcaggtggtccacccacct-xgacctcccaaactgctgggattacacatgtgagccaccatggccaaaaattagtattcaaacaatattatcctg 

Heplb  AGAGGT-  -GTTGGCAGGGCCACATATGA  TCCCTGA-TGTTGTCTCTCCTCAGG - CATCCTTCG - AAAAT— TATTC  CTTTAGTrAC  968 

**  *  *  ***  ****  *  ***  ****  *****  * 

FEA3 

MEPIB  CMa^^^AATATGAAOTCATGGGAACTGAGAC^CAGGrrcTTTCTAAAGAGAAATTATCCCAAGAAAAGAaSS^GTG^GAW  199 

Mepib  CATCCTAGGTAGTCTCATTATGGA - AATA - TTCTTA  TTTCC  *****  *  * 

****  **  *..  *  *  ****  *  **  *  ****  *  *  *** 

8I/c4x2  1/2  ERE  1/2  ERE/APl 

MEPIB  TAAGAGATGGATTGGGATCATTATTmirAGAAAATCAGlSa^TTCCTACTTTTAISaCXATCTCCCACCAGTGTTTG-CTTGATGGGAATTTTTA  298 

Mepib  TCAGGG - TGGGTTCATTCTCT - GGCATGTATG-GAT - TACTTT- -TGGCC - TCACAGGAGTTATTCTCTTCCT- 

*  *  *  ****  *****  *  *  ******  ******  **  **  **  **  ***  ** 

SX/cdx2 

GGGATGAGATACATTGCTATGTTACTACTCTATATGTGAATAACCGGTGTGCCTGATTTC-TAGCAC-ATTAATGGATTTTATAACTCAAIAAATTATAG  396 

_ ACTCTGTAT _ TGTAGCT-  -TCTCATGGCCCTAGTCA-GGAA AC  AGCT  -  ATTTGATTC  C  1156 


-TCCTA  1092 


MEPIB 

Mepib 


-ATACCCT-CT-TG- 

****  *  **  ** 
SX/cdx2 


*  *  *  *  * 

SX/cdx2 


****** 


*  *  *  *** 


495 


MEPIB  CKTACTCAATAATTlJialTAAATGGGTGAATGaTaMAAGCAGTGGACATAACTGGATATATTAAAGCAGCATGGGGCTATATGAGAGGAGGAGTTTCA 

Mepib  GGATTTAATAAAjCTATTATTAGA-  -GTC - TGAGGAAGA-CAGTG  AGAGGTTGGTTGATCCACATACTCTGTGCTTGTGTGTGTGTGTGTGTGT  1245 

*  *  *  *****  *  ******  *  *  ***  **  *  *****  ********  *  ******** 

nkx2.5  LPH/cdac2 

MEPIB  TMGATCAAGATTAAAAATTAATTTGATATATATATTTrPTGCIIMSISGTGTITA£TAGTCTAAGGAC-CAGTACACATACATTAGAGAATGTAT-TC  593 

_ n'TYTTArarATAC'AT - GTATGT-TTCTTTTTGCTGAAACAATGTT-AGTAATGTTGGGAAGCAATATGTATACATCAGAGAATATGCCTC  1331 

p  .  .....  .....  ....  . . ..  ***•  *****  ***  **  **  ******  *******  * 

nkx2.5  AP1/PEA3  0SE2  TATA 

MEPIB  ATTAAGAAGTCTCAGCCAGGCACIXaaPgTISagflCflGfiAflaAAAAAAAGCCAaCM&gg£a£TCCGCAGAGCTGTGITAA5agiaCTGGACTTTCATC  693 

Mepib  AGTGACAAGTTTTAGTTG . TTCAATGTTGGCACAGTAAAGAG - GGGGAAAAA - GAGGTGTGTCAGCAACCCTGGACSgAGC 

r~ — ►  TATA 

MEPIB  CCTTGATTGGAGGTATTATTACAGCATGAAGCTGACCTGCAATATGGATCATTGA - GACCAAAGGCimMTTATAATGTTTGCCTTTAAAGAA  787 

MeDlb  TCTTGATTGGAAGT -  -TACTATAGCAGCAAGATTACTTGACAGATAGATCATTAACATTAAGATCAAAGGCCGGAAGTTATGATGTTTAACTTTTAA  ££  14 

T777777777  ..  ..  **  ****  ****  *  *♦  ••  *  **  *******  *  **  *******  **  ****  ******  ****  ** 


MEPIB  GTCAA^I^^CCCTGAATGTCATGGTTAGCTACTTTCAACTGGAAGCTACAACATGGATTTATGGAATCTGTCTTGGTTTCTGTTCTTGGATGCTCTTCTCG  889 

2111  G^AGCTCT^CATGACTGCCATAGCTTGCAGCTTTCGTCTGGAAGCCACAGTATGGATGCCCGGCATCAGCCTTGGTTTCTGGTTTTTGCCACATTTCTCC  — 

T7r*  **  . **l  * ;  **  . ****** ***  ******  **  *“  * *********** 


1600 


74  Fig.  5.  Comparison  of  human  MEPIB  and  mouse  Mepib  sequences  upstream  of  the  P  mRNA  isoforms.  The  potential  transcription  factor-binding 

75  sites  identified  for  the  MEPIB  gene  by  Matters  and  Bond  ( 1999a)  are  underlined  and  the  factors  indicated.  Short  sequences  of  six  to  10  nucleotides, 

76  and  the  start  codon  for  translation,  conserved  in  the  two  genes  are  indicated  by  horizontal  bars.  The  starts  of  different  mRNA  isoforms  are 
78  indicated  by  arrows.  SI,  sucrase-isomaltase;  LPH,  lactase  phlorizin  hydrolase. 


87  Fig.  6.  Proposed  mechanisms  for  expression  of  the  meprin  p  gene  in  mouse  and  human  cells.  Splicing  of  exons  pi  and  p2  is  involved  in  expression 

88  of  the  mouse  Mepib  gene,  not  in  the  human  MEPIB  gene,  in  cancer  cells. 
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expression  of  Meplb  but  not  MEP1B.  The  promoter 
(P'J^)  for  directing  the  expression  of  the  P'  isoform  in 
mouse  cancer  cells  is  predicted  to  be  several  kb  away 
from  the  promoter  (P^)  for  the  p  isoform  expressed  in 
normal  cells.  In  contrast,  the  two  human  promoters  are 
predicted  to  be  proximal  to  each  other.  In  addition,  the 
human  proteins  produced  in  cancer  and  normal  cells 
are  the  same,  whereas  the  mouse  P'  protein  expressed  in 
cancer  cells  encodes  a  signal  peptide  and  part  of  the 
prosequence  that  differ  from  the  P  protein  produced  in 
normal  cells. 


460  3. 7.  Conclusions 

461  1.  The  mouse  Meplb  gene  spans  approximately  40  kb 

462  on  chromosome  18  and  consists  of  18  exons  and  17 

463  introns.  The  first  three  exons  compose  the  unique  5' 

464  end  of  P'  mRNA  found  in  mouse  cancer  cells;  the 

465  next  two  exons  correspond  with  the  5'  end  of  P 

466  mRNA  expressed  in  kidney  and  intestinal  cells.  The 

467  mouse  genomic  Southern  confirms  the  location  of 

468  the  P'  exons  in  the  mouse  DNA.  The  rest  of  the 

469  exons  (13  total)  encode  the  regions  common  to  both 

470  P  and  P'  messages.  In  conjunction  with  the  cDNA 

471  sequences,  the  Meplb  gene  structure  establishes  that 

472  alternative  splicing  of  5'  exons  is  responsible  for  the 

473  generation  of  the  two  mouse  mRNA  isoforms. 

474  2.  Potential  promoters  and  transcription  factor-binding 

475  sites  were  identified  in  the  upstream  regions  of  the  P- 

476  and  P'-specific  exons.  The  two  regions  showed  a 

477  significant  homology,  indicating  that  a  duplication 

478  event  occurred  during  evolution  of  the  Meplb  gene. 

479  However,  the  potential  regulatory  elements  were  not 

480  conserved,  suggesting  the  differential  regulation  of 

481  the  gene  in  normal  and  cancer  cells. 

482  3.  A  major  species  difference  exists  in  expression  of  the 

483  meprin  p  and  p'  isoforms  between  mouse  and  human 

484  in  normal  and  cancer  cells.  Alternative  splicing  of  5' 

485  exons  occurs  in  the  mouse  gene  expression,  whereas 

486  there  is  no  evidence  of  any  involvement  of  alternative 

488  splicing  in  the  human  gene  expression. 
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SUBSTRATE  SPECIFICITY  OF  THE  MOUSE  KIDNEY  METALLOENDOPEPTIDASES, 
MEPRINS  A  AND  B.  Greg  P.  Bertenshaw.  Gail  L.  Matters,  John  Bylander  and  Judith  S. 
Bond.  Department  of  Biochemistry  and  Molecular  Biology,  Pennsylvania  State  University, 
College  of  Medicine,  Hershey,  PA  17033,  USA. 

Meprins  A  and  B  are  zinc-dependent  metalloendopeptidases  of  the  astacin  family  and  metzincin 
superfamily.  They  consist  of  evolutionarily  related  a  and/or  (3  subunits.  A  high  level  of 
expression  of  these  secreted  or  membrane-bound  ectoenzymes  is  seen  in  the  brush  border 
membranes  of  intestine  and  kidney  proximal  tubules.  Meprins  are  capable  of  degrading  a 
variety  of  peptides  and  proteins  in  vitro.  Meprin  A  is  capable  of  hydrolyzing  gelatin,  basement 
membrane  proteins,  insulin  B  chain  and  numerous  peptides  including  bradykinin,  substance  P 
and  angiotensins.  Meprin  B  cleaves  protein  kinase  A,  insulin  B  chain  and  the  peptides  such  as 
gastrin.  In  order  to  identify  possible  physiological  substrates  of  meprin  additional 
gastrointestinal  peptides  were  examined  for  hydrolysis  by  the  meprins.  We  found  that  meprins 
are  able  to  degrade  gastrin-releasing  peptide,  glucagon,  secretin,  cholecystokinin,  peptide  YY, 
vasoactive  intestinal  peptide  and  orcokinin.  Some  peptides  (e.g.  CCK)  are  susceptible  to  both 
meprin  A  and  B,  however,  cleavage  occurs  at  different  peptide  bonds.  The  data  indicate  that 
although  the  a  and  (3  subunits  of  meprins  are  very  similar  with  respect  to  amino  acid  sequence 
(58%  amino  acid  identity  within  the  protease  domain),  they  have  very  different  substrate 
specificities.  In  this  study  we  show  that  meprin  A  has  a  rather  broad  specificity,  in  contrast 
meprin  B  is  much  more  specific  with  preference  for  aspartate  or  glutamate  residues  at  the  PI’ 
site.  We  conclude  that  Meprin  B  is  predominantly  an  acidic-N  endopeptidase.  This  study 
focuses  on  a  kinetic  comparison  of  meprins  A  and  B  using  established  and  newly  identified 
peptide  substrates.  Here  we  report  cleavage  sites  as  well  as  kcat  and  Km  values  for  meprins 
against  each  peptide.  Furthermore  we  address  the  role  of  the  individual  subunits  in  the 
degradation  of  extracellular  membrane  proteins  by  meprin  A.  This  work  is  fundamental  to  the 
development  of  specific  inhibitors  to  the  individual  subunits  to  further  delineate  the  function  of 
meprins  in  vivo. 


Substrate  Specificity  of  the  Mouse  Kidney  Metalloendopeptidases 

Meprins  A  and  B 

Greg  P.  Bertenshaw.  Gail  L.  Matters  John  Bylander  and  Judith  S.  Bond 
Pennsylvania  State  University,  College  of  Medicine,  Hershey,  Pennsylvania,  USA 

Meprins  A  and  B  are  zinc  dependent  metalloendopeptidases  of  the  astacin  family  and 
metzincin  superfamily.  They  consist  of  evolutionarily  related  a  and  /  or  (3  subunits.  A 
high  level  of  expression  of  these  secreted  or  membrane-bound  ectoenzymes  is  seen  in 
intestine  and  kidney  proximal  tubules.  There  is  evidence  that  implicates  meprins  in  the 
susceptibility  to  renal  disease;  i.e.,  recent  segregation  and  linkage  analyses  showed  the 
meprin  p  gene  to  have  major  effects  on  the  prevalence  of  diabetic  nephropathy  in  Pima 
Indians.  Meprins  are  capable  of  degrading  a  variety  of  peptides  and  proteins  in  vitro.  The 
study  presented  here  is  directed  towards  the  identification  of  additional  physiological 
substrates  that  are  susceptible  to  meprins.  Meprin  A  is  capable  of  hydrolyzing  gelatin, 
basement  membrane  proteins,  insulin  B  chain  and  numerous  peptides  including 
bradykinin,  substance  P  and  angiotensins.  Meprin  B  cleaves  protein  kinase  A,  insulin  B 
chain  and  the  peptides  gastrin,  cholecystokinin  and  substance  P  among  others.  These  data 
indicate  that  although  the  a  and  p  subunits  of  meprins  are  very  similar  with  respect  to 
amino  acid  sequence  (58%  amino  acid  identity  within  the  protease  domain)  they  have 
very  different  substrate  specificity’s.  This  study  focuses  on  a  kinetic  comparison  of 
meprins  A  and  B  using  established  and  newly  identified  substrates.  This  work  will  allow 
for  the  development  of  specific  inhibitors  to  the  individual  subunits  to  further  delineate 
the  function  of  meprins  in  vivo. 
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